SEQUENCE LISTING 



<110> XIANG, JINHUA 

WUNSCHMANN , SABINA 
SCHMIDT, WARREN 
STAPLETON, JACK T. 

<120> FULL-LENGTH GB VIRUS C (HEPATITIS G VIRUS) RNA TRANSCRIPTS ARE INFECTIOUS 
IN PRIMARY CD4 POSITIVE T CELLS 

<130> IOWA:030US 

<140> UNKNOWN 
<141> 2001-04-05 

<140> 60/253,390 
<141> 2000-11-27 

<150> 60/195,597 
<151> 2000-04-06 



* <160> 2 

y <170> Patentln Ver. 2.1 

fl <210> 1 

S <211> 9395 

J <212> DNA 

f| <213> Hepatitis G virus 



<400> 1 

tgacgtgggg 

ctatctaagt 

gggtggtgac 

gtcttaagag 

gtgctggccc 

gtcacccacc 

tgaccaatag 

aggaccccaa 

cggcggcctg 

ttttctatac 

tggccccggc 

ccccggaaga 

tttgcaccga 

ccgcggccca 

acgtagccgg 

cgttgaggcg 

agtggggaag 

aatacctttg 

tggtttgggt 

tgacgatggc 

actacgggtt 

gggagagggt 

gggtttgggt 

gccacggaca 

taacgtgcgt 

tcgatgtgtg 



gggttgatcc 
agacgcaatg 
agggttggta 
aaggtcaaga 
taccggtgtg 
tgggcaaacg 
gtttatccgg 
accctgccct 
cagccggggt 
catcatggca 
cacccacgct 
catcgggttc 
ccgttgctgg 
gctcgttggg 
gatcctgggt 
ccgggtctac 
tgagttttgg 
gaaagtccca 
ggccgcattg 
ggggatgttg 
gaagtggcag 
gtgggatcga 
cccggccttt 
aaaccagtgg 
gtggggttcc 
gagtttggtg 



cccccccccg 
actcggcgcc 
ggtcgtaaat 
ctcctcttgt 
aataagggcc 
acgcccacgt 
cgagttgaca 
tcccggtggg 
agcccaagaa 
gtccttctgc 
tgtcgagcga 
tgcctggaag 
ccactgtatc 
gaactgggga 
ctgggcgagg 
ctgatgccca 
agatggactg 
tttgaatttt 
cttttgctgg 
caaggcgccc 
tcatgctcct 
gggaatgtca 
tgccaggcgg 
cccctatcat 
gtgtcttggt 
ccggttggat 



gcactgggtg 
gactcggcga 
cccggtcatc 
gcctgcggcg 
cgacgtcagg 
acggtccacg 
aggaccagtg 
ccgggaaatg 
tccttcgggt 
tccttctcgt 
atgggcaata 
gcggatgcct 
aggcgggttt 
gcctgtacgg 
tttactccgg 
acctgaagtg 
agcagttggc 
ggagaggagt 
agcaacggat 
ccgcctccgt 
gcagggctaa 
cgctcttgtg 
ttgggtgggg 
gcccccaata 
ttgcctcgac 
ctgccagctg 



caagccccat 
ccggccaaaa 
ctggtagcca 
agaccgcgca 
ctcgtcgtta 
tcgcccttca 
ggggccgggg 
catggggcca 
gagggcgggt 
ggttgaggcc 
tttcctcaca 
ggtggccctg 
ggctgtgcgg 
gcccttgtcg 
ggtcctgaca 
tgcagtagaa 
ctccaattac 
gatgagcctg 
tgtcatggtt 
tttggggtcc 
cgggtcgcgt 
tgactgcccc 
cgaccccatc 
tgtctatggg 
cggcggtcgt 
caccatagcc 



aaaccgacgc 
ggtggtggat 
ctataggtgg 
cggtccacag 
aaccgagccc 
atgtctctct 
gttatgggga 
cccagctccg 
ggcatttctc 
ggggccattc 
aattgctgtg 
gggtgcacgg 
cctggcaagt 
gtctcggctt 
gttggtgttg 
tgtgacgtta 
tggattttgg 
acccctctgt 
ttcctgctgg 
cgcccctttg 
attcccactg 
aacggcccct 
acccattgga 
tctgtgtccg 
gattcgaaga 
gctctagggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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catcggatcg cgacacggtg gttgagctct ccgagtgggg agtcccgtgc gtaacgtgta 1620 

ttctggaccg tcggcctgct tcatgtggca cctgtgtgcg ggactgctgg cccgaaaccg 1680 

ggtcggttag attccctttc catcggtgcg gcacggggcc tcggctgaca aaggacttgg 1740 

aagctgtgcc cttcgtcaac aggacaactc ccttcaccat aaggggcccc ctgggcaacc 1800 

aggggagagg caacccggtg cggtcgcccc tgggttttgg gtcctacacc atgaccaaga 1860 

tccgggattc cctgcatttg gtgaaatgtc ccacaccagc catagagcct ccgactggaa 1920 

cgttcgggtt cttccccgga gtcccgccca ttaacaactg catgccgcta ggcacggaag 1980 

tgtctgaggc attgggcgga gctgggctta cgggggggtt ctacgagcct ctggttcgca 2040 

ggtgttcgga gctgatggga cgccgaaatc cggtttgccc ggggtacgca tggctgtcct 2100 

ctggtagacc tgacgggttc atacacgtcc aggggcacct gcaggaggtg gatgcgggca 2160 

acttcatccc tcctccacgc tggttgctct tggattttgt atttgtcctg ctctatctga 2220 

tgaagctggc tgaggcacgg ttggtcccgt tgatcttgct tctgctgtgg tggtgggtga 2280 

accagttggc ggttctagga ctgccggctg tggacgctgc cgtggcgggt gaagtttttg 2340 

cgggccctgc cttgtcatgg tgtttgggcc ttcccactgt cagtatgata ctaggtctag 2400 

caaacctggt gttgtacttt cggtggatgg gccctcagcg cctcatgttc ctcgtgttgt 2460 

ggaagctcgc tcggggagct ttcccgctgg cacttttgat ggggatttcg gcgacccgcg 2520 

ggcgcacctc tgtgctcggg gccgagttct gcttcgatgt cacattcgag gtggacactt 2580 

cggtgttggg ctgggtggtg gccagcgtgg tggcttgggc catagcgctc ctgagctcaa 2640 

tgagcgcagg ggggtggaag cacaaggccg tgatctatag gacgtggtgt aaagggtacc 2700 

aggctgtgcg ccagagggtg gtgcggagcc ccctcgggga ggggcgtcct accaagcttc 2760 

? a tgacgttcgc ctggtgcttg gcctcataca tctggccgga tgctgtgatg atggtggtgg 2820 

tggccttggt cctcctcttc ggcctgttcg acgcactgga ctgggccctg gaggagctcc 2880 

tggtctcccg gccctcgtta cggcgactgg cacgggtggt tgagtgctgt gtgatggcgg 2940 

^ gcgagaaggc caccaccatc cgactggtct ccaagatgtg cgcaagaggg gcctacctgt 3000 

FiJ ttgaccacat gggctctttc tcgcgcgctg tcaaggagcg cttgttggaa tgggacgcgg 3060 

E0 ctttggagcc cttgtcattc actaggacgg actgtcgcat catcagagat gccgcgagga 3120 

S S ccctgtcctg cggacagtgc gtcatgggtt tacccgtggt agcacggcgc ggtgatgagg 3180 

■uj ttctcatcgg cgtctttcag gatgtgaatc atttgcctcc cgggtttgtc ccgactgcac 3240 

fq cagttgtcat ccgtcggtgc ggaaagggct tcctgggggt cacgaaggca gccttgacag 3300 

I gtagggatcc tgacttacat ccagggaacg tcatggtgtt ggggacggct acgtcacgaa 3360 

gcatgggcac atgtctgaat ggcctgctgt tcacaacttt ccatggggct tcatcccgaa 3420 

"~ ccatcgccac gcccgtgggg gcccttaatc ccaggtggtg gtcagccagt gatgacgtca 3480 

cggtgtaccc gcttccagat ggggcaactt cgttgacgcc ctgcacttgc caggcggagt 354 0 

^ cctgttgggt tattagatcc gacggggctt tgtgccatgg cttgagcaag ggggacaagg 3600 

Ml ttgagctgga tgtggccatg gaggtctctg acttccgtgg ttcgtctggt tcaccggtcc 3660 

£3 tttgcgacaa agggcacgca gtaagaatgc tcgtgtcagt gctccactct ggcggcaggg 3720 

[:» ttactgcggc gcgattcact aggccgtgga ctcaagtacc aacagatgcc aagactacca 3780 

cagaaccccc tccggtgccg gcaaaaggag ttttcaagga ggccccgttg tttatgccta 3840 

cgggggcggg aaagagcacc cgcgtaccgt tggagtacgg caacatgggc cacaaggtct 3900 

tgatcttgaa cccgtcggta gctaccgtga gggccatggg cccatacatg gagcggctgg 3960 

cggggaaaca ccccagtatt tactgtggcc atgacaccac tgctttcaca aggatcactg 4020 

actcgcccct tacgtattcc acttacggaa ggtttttggc caaccctagg cagatgctga 4 080 

ggggtgtgtc ggtggtcatt tgtgacgagt gccacagtca tgactcaact gtgttgttgg 4140 

gcattgggcg tgtcagggag ctggcgcgag gatgtggagt gcaattggtg ctctacgcca 4200 

ctgccacccc tcccggatcc ccgatgaccc agcacccatc aatcattgag acaaaactgg 4260 

acgtgggaga gatccccttc tatgggcatg gcatacctct tgagcggatg cggaccggaa 4320 

ggcatctcgt attctgccac tccaaggctg agtgcgagcg cctggcgggc cagttttcgg 4380 

ctaggggggt aaatgccatc gcctattaca gggggaaaga cagttctatc atcaaagatg 4440 

gagacctggt ggtgtgtgct acagacgcac tatccactgg gtacactggg aacttcgatt 4500 

ctgtcaccga ttgtgggtta gtggtggagg aggtcgtcga ggtgaccctt gatcccacca 4560 

ttaccatctc cctgcgcacg gtgcccgcgt cggctgaact gtcgatgcag cggcgaggac 4620 

gcacgggtag gggcaggtct gggcgctact actacgcggg ggtcggcaag gcccctgctg 4680 

gtgtggtgcg ctcaggtcct gtctggtcgg cggtggaagc cggtgtgacc tggtacggaa 474 0 

tggaacctga cctgacagca aacctactga gactttacga caactgccct tacaccgcag 4800 

ccgtcgcagc tgacattggg gaagccgcgg tgttcttttc ggggcttgcc ccgttgagga 4860 

tgcatcccga tgttagctgg gcaaaagttc gcggcgtcaa ctggcccttc ctggtgggtg 4 920 

ttcagcggac catgtgccgg gaaacactgt ctcccggccc atcggatgac ccccagtggg 4980 
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caggtctgaa 
ctaaagtggc 
gttacgtccg 
tgatctatgc 
gtggcagccc 
ccccggtaga 
atgcggtggc 
aagtgctgtc 
ttgctggctg 
tcgccggggg 
cggcctatgg 
tgggcgtcgg 
ggaccgctct 
ctagcgtctc 
tggtgaatgc 
tgtggtatgc 
ggttggtgtt 
ctacattgcc 
acaaggtctc 
acagggagcc 
tcatgcgtca 
tacccttatg 
agagtcgttg 
atccagttta 
tgctgggcta 
tcgggacgtc 
catccgccta 
acgtcgacgg 
gccctgggca 
ttaggaatgt 
ctgaagactc 
tcgagaatgc 
caccctctct 
ctccatcgcc 
tgtcttcctc 
agacagccga 
ttccacagag 
gcgtaacacg 
tggaaatcca 
ttgggtgctt 
agaccttggc 
ccaagccccc 
atgtcaccaa 
ctagggttca 
cctgcctaag 
ccatgggctg 
ctgtccatga 
tgaaaaagga 
tcccccccct 
ccaaggcggt 
agatgctcaa 
gcttcgacag 
cttcagacca 
taacccccga 
ccagtgcgag 
ggctgaaaaa 
ctgtgtgcga 



gggcccgaat 
cggccatcac 
ctgcgatgcg 
gtcatacacc 
cctttatcgg 
ccatcggccg 
ggccatccag 
cttggcccag 
ctacacgggg 
ctgggcggcg 
ggcttctagg 
aggcaacgcg 
gggcacgcct 
cccctccttg 
ggctagcctt 
catcccagtg 
gtactcagct 
aaggtcctca 
agctgtgctc 
taaggtggat 
agtgcgcatg 
gcactgcggg 
tctttgtggt 
ctctaccaag 
tggcgagacg 
tgggtgggct 
caaactgctg 
cataccggtc 
aagtgtcacc 
ggcgccctct 
agaactgact 
tgcgagaatt 
ttgtgggagt 
agcacttatc 
gcaggaggat 
aggggaggaa 
cgatgccaca 
cttcttttca 
gaaccataca 
ggtgggcaat 
ttccttctct 
tgtggtgagg 
cccggacaat 
tgacaaattc 
catgggttac 
gggatctaag 
ccggctccag 
agtgttcttc 
ggacttccgg 
gttggggggg 
actgtgggaa 
tagcataact 
tccagaatgg 
gggggtgcca 
caactgcttg 
tgtctcgctc 
tcctagcgac 



cctgtcccac 
atcgtggacg 
ggacccatct 
gggtctctcg 
catggagacc 
gggggagagt 
gtggattgcg 
gctaagacgg 
acgcgggccg 
gtggtaggcc 
agccctccat 
caaacccgct 
gtcgtggggt 
gtcaccattt 
gtcttcgact 
ctaaccagtc 
aacaactctg 
tgcatccctg 
cgacgcttga 
gaggttcagg 
gtgatggcca 
gaggggtggt 
tgcgtgatca 
ctgtgcaggc 
tcgcctttgc 
gaggtggtgg 
cgccagcaaa 
tcatgggacg 
attgacgggg 
gaggtgtcat 
gaggccgacc 
cttgaacctc 
agccgagaga 
tcggttactg 
accccgtctt 
agcgtcttca 
agaaagctta 
ttgggattga 
gcctattgtg 
gaacttacct 
tacatttggt 
ccggttggct 
gttgggagaa 
ctcgtggact 
acttatgagg 
gtgtcggtca 
gagatacttg 
aaagaccgaa 
atagctgaaa 
gcctacgcct 
tcaaagaaga 
gaagaggacg 
gtgcgtgccc 
gtgggtgaga 
acttgctata 
ctcatcgctg 
gctttgggca 



tcctgctgag 
acctggtccg 
tgatggtggg 
tggtggttac 
aggccacgcc 
ctgcgccatc 
attggtcagt 
ccgaggccta 
tccccactgt 
attgccacag 
tggctgctgc 
tagcctccgc 
taaccatggc 
tactgggggc 
ttatggcggg 
cgggggcagg 
gcactaccac 
acagttactt 
gcctcactcg 
tggggtacgt 
gacttcgggc 
ccggagaatg 
ccggtgatgt 
attattggat 
tcgcctcaga 
tgacccctac 
tcctgtcggc 
cggacgcgcg 
aacgctacac 
ccgaggtgtc 
tgccgccggc 
acatagatgt 
tgcctgtgtg 
agagcagccc 
ctgactcatt 
acgtggctct 
ccgttaagat 
cggtcgctga 
acaaggtgcg 
ttgaatgtga 
ctggggtgcc 
ccttgctggt 
gagttgacaa 
ccatagagcg 
aggcaataag 
aggacctcgc 
aagggacgcc 
aggaagagaa 
agcttattct 
tccagtacac 
caccatgcgc 
tggcgctgga 
tggggaaata 
ggtattgtag 
tcaaggtgaa 
gcgatgactg 
gagccctggc 



gtggggcaat 
taggctcggg 
cctcgctatt 
agactgggat 
ccagccggtt 
ggatgccaac 
catgaccctg 
cgcagctacc 
ttcaattgtt 
tgtaatagct 
cgcttcctac 
tctcctacta 
gggcgcgttc 
cgtggggggc 
gaaactatca 
acttgcgggg 
ttggttgaac 
tcagcaggcc 
caccgtggtt 
ctgggacttg 
cctctgcccc 
gttgttggac 
tttgaatggg 
ggggacagtc 
caccccgaag 
ccacgttgtg 
tgctgttgct 
agcgcctgcc 
ccttccgcat 
cattgacatt 
ggctgcagcc 
catcatggaa 
gggagaagac 
agatgagaag 
cgaggtcatc 
ttccgtacta 
gtcatgctgt 
cgtggcaagc 
cactccgctt 
caagtgtgag 
actgacgagg 
ggccgacacc 
ggttaccttc 
cgctaagagg 
gactgtaagg 
cacccctgcg 
agtccccttt 
ggccccccgc 
gggagaccct 
cccaaatcag 
catctgtgtg 
gacagagctt 
ctatgcctct 
atcctcaggg 
agccgcctgt 
tttgatcata 
gagctacggg 



gatttaccat 
gtggcggagg 
gcggggggca 
gtgaaggggg 
gtgcaggtcc 
acagtgacag 
tcgatcgggg 
accaagtggc 
gacaagctct 
gcggcagtgg 
ctcatggggt 
ggggccgctg 
atgggaagtg 
tgggagggcg 
tcagaagatc 
atcgccctcg 
cgtctgctga 
gattactgtg 
gccctggtca 
tgggagtgga 
gtggtgtcat 
ggccatgttg 
caactcaaag 
cctgtgaaca 
gtggtaccat 
atcaggcgaa 
gagccctatt 
atggtctatg 
caactgcggc 
gggacggaga 
cttcaggcta 
gattgcagta 
ataccccgca 
accccgtcgg 
caagagtccg 
aaagccttgt 
gttgagaaga 
ctgtgtgaga 
gaattgcagg 
gctaggcaag 
gccactccgg 
accaaggtgt 
tggcgtgccc 
gcagctcaag 
ccacatgctg 
gggaagatgg 
actcttactg 
ctcattgtgt 
ggacgggtag 
cgaattaggg 
gacgccacat 
tatgccctgg 
ggcacaatgg 
gtcttgacca 
gagagggtgg 
tgcgaacggc 
tacgcatgcg 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
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agccttcgta 
gcaatgcaga 
gcatgtcgag 
cttggcatcc 
gtggtggcac 
cactggacaa 
ccgcagacac 
ctggcctagc 
gcggttgggc 
cggagattgc 
atcaattgga 
tcatcgtagc 
tcatcactgg 
ggcccggcct 
gctaatgcac 
gctgctcgca 
ccggtaagag 



tcatgcatca 
tgggaaacgc 
cgagtacagt 
tatcacacgg 
accgtctgat 
actgcctaac 
aactaagaca 
agtccaccgg 
tgagttggct 
tggtatcccc 
tttcacaagc 
cctcttcggg 
aggaggttcc 
tgggaggcat 
tgccacttcg 
gagccctccc 
ctcggcccaa 



ctggacacgg 
catttcttcc 
gacccaatgg 
tgggtcatca 
cctgtgtggt 
atcatcgtgg 
aaaatggagg 
aagaaggccg 
agggggctgt 
gggggtttcc 
cagaggagtc 
tgaactaaat 
cgccctcccc 
ggtggttact 
gtggcgggtc 
cggatggggc 
aggccgggtt 



cccccttctg 
tgaccacgga 
cttcggccat 
tccctcacgt 
gccaggtaca 
ccctccacgg 
ctggcaaggt 
gggcattgcg 
tgtggcgtcc 
ccctttcccc 
gctggcggtg 
tcatctgttg 
gccccagggg 
aaccccctgg 
gctaccttat 
acagtgcact 
ctact 



ctccacttgg 
ctttcggagg 
cggttacatc 
gctcacctgc 
tggtaattac 
accagcagcg 
gctgagcgac 
aacgcgtatg 
aggcctgcgg 
cccctatatg 
gttggggttc 
cggcaaggtc 
tctccccgct 
cagggtcaaa 
agcgtaatcc 
gtgatctgaa 



ctagctgagt 
cccctcgctc 
ctcctatacc 
gcgtttaggg 
tacaagtttc 
ttgagggtta 
ctcaagctcc 
ctccggtcgc 
cttccccctc 
ggggtggttc 
ttagccctgc 
cggtgactga 
gggtaaaaag 
gcctgatggt 
gtgactacgg 
ggggtgcacc 



8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9395 



<210> 2 
<211> 2910 
<212> PRT 

<213> Hepatitis G virus 
<400> 2 

Met Ser Leu Leu Thr Asn Arg Phe lie Arg Arg Val Asp Lys Asp Gin 
15 10 15 

Trp Gly Pro Gly Val Met Gly Lys Asp Pro Lys Pro Cys Pro Ser Arg 
20 25 30 

Trp Ala Gly Lys Cys Met Gly Pro Pro Ser Ser Ala Ala Ala Cys Ser 
35 40 45 

Arg Gly Ser Pro Arg lie Leu Arg Val Arg Ala Gly Gly lie Ser Leu 
50 55 60 

Phe Tyr Thr lie Met Ala Val Leu Leu Leu Leu Leu Val Val Glu Ala 
65 70 75 80 

Gly Ala lie Leu Ala Pro Ala Thr His Ala Cys Arg Ala Asn Gly Gin 
85 90 95 

Tyr Phe Leu Thr Asn Cys Cys Ala Pro Glu Asp lie Gly Phe Cys Leu 
100 105 110 

Glu Gly Gly Cys Leu Val Ala Leu Gly Cys Thr Val Cys Thr Asp Arg 
115 120 125 

Cys Trp Pro Leu Tyr Gin Ala Gly Leu Ala Val Arg Pro Gly Lys Ser 
130 135 140 

Ala Ala Gin Leu Val Gly Glu Leu Gly Ser Leu Tyr Gly Pro Leu Ser 
145 150 155 160 



Val Ser Ala Tyr Val Ala Gly lie Leu Gly Leu Gly Glu Val Tyr Ser 
165 170 175 



Gly Val Leu Thr Val Gly Val Ala Leu Arg Arg Arg Val Tyr Leu Met 
180 185 190 



Pro Asn Leu Lys Cys Ala Val Glu Cys Asp Val Lys Trp Gly Ser Glu 
195 200 205 

Phe Trp Arg Trp Thr Glu Gin Leu Ala Ser Asn Tyr Trp lie Leu Glu 
210 215 220 

Tyr Leu Trp Lys Val Pro Phe Glu Phe Trp Arg Gly Val Met Ser Leu 
225 230 235 240 

Thr Pro Leu Leu Val Trp Val Ala Ala Leu Leu Leu Leu Glu Gin Arg 
245 250 255 

He Val Met Val Phe Leu Leu Val Thr Met Ala Gly Met Leu Gin Gly 
260 265 270 

Ala Pro Ala Ser Val Leu Gly Ser Arg Pro Phe Asp Tyr Gly Leu Lys 
£3 275 280 285 

"\ 

£3 Trp Gin Ser Cys Ser Cys Arg Ala Asn Gly Ser Arg He Pro Thr Gly 
290 295 300 

in 

Glu Arg Val Trp Asp Arg Gly Asn Val Thr Leu Leu Cys Asp Cys Pro 
305 310 315 320 



^ Asn Gly Pro Trp Val Trp Val Pro Ala Phe Cys Gin Ala Val Gly Trp 
s 325 330 335 

=S Gly Asp Pro He Thr His Trp Ser His Gly Gin Asn Gin Trp Pro Leu 
p 340 345 350 

?=i Ser Cys Pro Gin Tyr Val Tyr Gly Ser Val Ser Val Thr Cys Val Trp 
355 360 365 

Gly Ser Val Ser Trp Phe Ala Ser Thr Gly Gly Arg Asp Ser Lys He 
370 375 380 

Asp Val Trp Ser Leu Val Pro Val Gly Ser Ala Ser Cys Thr He Ala 
385 390 395 400 

Ala Leu Gly Ser Ser Asp Arg Asp Thr Val Val Glu Leu Ser Glu Trp 
405 410 415 

Gly Val Pro Cys Val Thr Cys He Leu Asp Arg Arg Pro Ala Ser Cys 
420 425 430 

Gly Thr Cys Val Arg Asp Cys Trp Pro Glu Thr Gly Ser Val Arg Phe 
435 440 445 

Pro Phe His Arg Cys Gly Thr Gly Pro Arg Leu Thr Lys Asp Leu Glu 
450 455 460 

Ala Val Pro Phe Val Asn Arg Thr Thr Pro Phe Thr He Arg Gly Pro 
465 470 475 480 
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Leu Gly Asn Gin Gly Arg Gly Asn Pro Val Arg Ser Pro Leu Gly Phe 
485 490 495 



Gly Ser Tyr Thr Met Thr Lys lie Arg Asp Ser Leu His Leu Val Lys 
500 505 510 

Cys Pro Thr Pro Ala lie Glu Pro Pro Thr Gly Thr Phe Gly Phe Phe 
515 520 525 

Pro Gly Val Pro Pro lie Asn Asn Cys Met Pro Leu Gly Thr Glu Val 
530 535 540 

Ser Glu Ala Leu Gly Gly Ala Gly Leu Thr Gly Gly Phe Tyr Glu Pro 
545 550 555 560 

Leu Val Arg Arg Cys Ser Glu Leu Met Gly Arg Arg Asn Pro Val Cys 
565 570 575 

Pro Gly Tyr Ala Trp Leu Ser Ser Gly Arg Pro Asp Gly Phe lie His 
fl 580 585 590 

Val Gin Gly His Leu Gin Glu Val Asp Ala Gly Asn Phe lie Pro Pro 

pfj 595 600 605 

; is? 

b J Pro Arg Trp Leu Leu Leu Asp Phe Val Phe Val Leu Leu Tyr Leu Met 
610 615 620 



[0 Lys Leu Ala Glu Ala Arg Leu Val Pro Leu lie Leu Leu Leu Leu Trp 
g 625 630 635 640 

C3 

j Trp Trp Val Asn Gin Leu Ala Val Leu Gly Leu Pro Ala Val Asp Ala 

645 650 655 



P 
Iff 



I'S, Ala Val Ala Gly Glu Val Phe Ala Gly Pro Ala Leu Ser Trp Cys Leu 

660 665 670 

Gly Leu Pro Thr Val Ser Met lie Leu Gly Leu Ala Asn Leu Val Leu 
675 680 685 



Tyr Phe Arg Trp Met Gly Pro Gin Arg Leu Met Phe Leu Val Leu Trp 
690 695 700 

Lys Leu Ala Arg Gly Ala Phe Pro Leu Ala Leu Leu Met Gly lie Ser 
705 710 715 720 

Ala Thr Arg Gly Arg Thr Ser Val Leu Gly Ala Glu Phe Cys Phe Asp 
725 730 735 

Val Thr Phe Glu Val Asp Thr Ser Val Leu Gly Trp Val Val Ala Ser 
740 745 750 

Val Val Ala Trp Ala He Ala Leu Leu Ser Ser Met Ser Ala Gly Gly 
755 760 765 



Trp Lys His Lys Ala Val He Tyr Arg Thr Trp Cys Lys Gly Tyr Gin 
770 775 780 



6 



Ala Val Arg Gin Arg Val Val Arg Ser Pro Leu Gly Glu Gly Arg Pro 
785 790 795 800 



Thr Lys Leu Leu Thr Phe Ala Trp Cys Leu Ala Ser Tyr lie Trp Pro 
805 810 815 

Asp Ala Val Met Met Val Val Val Ala Leu Val Leu Leu Phe Gly Leu 
820 825 830 

Phe Asp Ala Leu Asp Trp Ala Leu Glu Glu Leu Leu Val Ser Arg Pro 
835 840 845 

Ser Leu Arg Arg Leu Ala Arg Val Val Glu Cys Cys Val Met Ala Gly 
850 855 860 

Glu Lys Ala Thr Thr lie Arg Leu Val Ser Lys Met Cys Ala Arg Gly 
865 870 875 880 

Ala Tyr Leu Phe Asp His Met Gly Ser Phe Ser Arg Ala Val Lys Glu 
885 890 895 

Arg Leu Leu Glu Trp Asp Ala Ala Leu Glu Pro Leu Ser Phe Thr Arg 
900 905 910 

Thr Asp Cys Arg lie lie Arg Asp Ala Ala Arg Thr Leu Ser Cys Gly 
915 920 925 

Gin Cys Val Met Gly Leu Pro Val Val Ala Arg Arg Gly Asp Glu Val 
930 935 940 

Leu lie Gly Val Phe Gin Asp Val Asn His Leu Pro Pro Gly Phe Val 
945 950 955 960 

Pro Thr Ala Pro Val Val lie Arg Arg Cys Gly Lys Gly Phe Leu Gly 
965 970 975 

Val Thr Lys Ala Ala Leu Thr Gly Arg Asp Pro Asp Leu His Pro Gly 
980 985 990 

Asn Val Met Val Leu Gly Thr Ala Thr Ser Arg Ser Met Gly Thr Cys 
995 1000 1005 

Leu Asn Gly Leu Leu Phe Thr Thr Phe His Gly Ala Ser Ser Arg Thr 
1010 1015 1020 

lie Ala Thr Pro Val Gly Ala Leu Asn Pro Arg Trp Trp Ser Ala Ser 
1025 1030 1035 1040 

Asp Asp Val Thr Val Tyr Pro Leu Pro Asp Gly Ala Thr Ser Leu Thr 
1045 1050 1055 

Pro Cys Thr Cys Gin Ala Glu Ser Cys Trp Val lie Arg Ser Asp Gly 
1060 1065 1070 



Ala Leu Cys His Gly Leu Ser Lys Gly Asp Lys Val Glu Leu Asp Val 
1075 1080 1085 



Ala Met Glu Val Ser Asp Phe Arg Gly Ser Ser Gly Ser Pro Val Leu 
1090 1095 1100 



Cys Asp Lys Gly His Ala Val Arg Met Leu Val Ser Val Leu His Ser 
1105 1110 1115 1120 

Gly Gly Arg Val Thr Ala Ala Arg Phe Thr Arg Pro Trp Thr Gin Val 
1125 1130 1135 

Pro Thr Asp Ala Lys Thr Thr Thr Glu Pro Pro Pro Val Pro Ala Lys 
1140 1145 1150 

Gly Val Phe Lys Glu Ala Pro Leu Phe Met Pro Thr Gly Ala Gly Lys 
1155 1160 1165 

Ser Thr Arg Val Pro Leu Glu Tyr Gly Asn Met Gly His Lys Val Leu 
1170 1175 1180 

lie Leu Asn Pro Ser Val Ala Thr Val Arg Ala Met Gly Pro Tyr Met 
1185 1190 1195 1200 

Glu Arg Leu Ala Gly Lys His Pro Ser lie Tyr Cys Gly His Asp Thr 
1205 1210 1215 

Thr Ala Phe Thr Arg lie Thr Asp Ser Pro Leu Thr Tyr Ser Thr Tyr 
1220 1225 1230 

Gly Arg Phe Leu Ala Asn Pro Arg Gin Met Leu Arg Gly Val Ser Val 
1235 1240 1245 

Val lie Cys Asp Glu Cys His Ser His Asp Ser Thr Val Leu Leu Gly 
1250 1255 1260 

lie Gly Arg Val Arg Glu Leu Ala Arg Gly Cys Gly Val Gin Leu Val 
1265 1270 1275 1280 

Leu Tyr Ala Thr Ala Thr Pro Pro Gly Ser Pro Met Thr Gin His Pro 
1285 1290 1295 

Ser lie lie Glu Thr Lys Leu Asp Val Gly Glu lie Pro Phe Tyr Gly 
1300 1305 1310 

His Gly lie Pro Leu Glu Arg Met Arg Thr Gly Arg His Leu Val Phe 
1315 1320 1325 

Cys His Ser Lys Ala Glu Cys Glu Arg Leu Ala Gly Gin Phe Ser Ala 
1330 1335 1340 

Arg Gly Val Asn Ala lie Ala Tyr Tyr Arg Gly Lys Asp Ser Ser lie 
1345 1350 1355 1360 

lie Lys Asp Gly Asp Leu Val Val Cys Ala Thr Asp Ala Leu Ser Thr 
1365 1370 1375 

Gly Tyr Thr Gly Asn Phe Asp Ser Val Thr Asp Cys Gly Leu Val Val 
1380 1385 1390 
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Glu Glu Val Val Glu Val Thr Leu Asp Pro Thr He Thr He Ser Leu 
1395 1400 1405 



Arg Thr Val Pro Ala Ser Ala Glu Leu Ser Met Gin Arg Arg Gly Arg 
1410 1415 1420 

Thr Gly Arg Gly Arg Ser Gly Arg Tyr Tyr Tyr Ala Gly Val Gly Lys 
1425 1430 1435 1440 

Ala Pro Ala Gly Val Val Arg Ser Gly Pro Val Trp Ser Ala Val Glu 
1445 1450 1455 

Ala Gly Val Thr Trp Tyr Gly Met Glu Pro Asp Leu Thr Ala Asn Leu 
1460 1465 1470 

Leu Arg Leu Tyr Asp Asn Cys Pro Tyr Thr Ala Ala Val Ala Ala Asp 
1475 1480 1485 

He Gly Glu Ala Ala Val Phe Phe Ser Gly Leu Ala Pro Leu Arg Met 
1490 1495 1500 

His Pro Asp Val Ser Trp Ala Lys Val Arg Gly Val Asn Trp Pro Phe 
1505 1510 1515 1520 

Leu Val Gly Val Gin Arg Thr Met Cys Arg Glu Thr Leu Ser Pro Gly 
1525 1530 1535 

Pro Ser Asp Asp Pro Gin Trp Ala Gly Leu Lys Gly Pro Asn Pro Val 
1540 1545 1550 

Pro Leu Leu Leu Arg Trp Gly Asn Asp Leu Pro Ser Lys Val Ala Gly 
1555 1560 1565 

His His He Val Asp Asp Leu Val Arg Arg Leu Gly Val Ala Glu Gly 
1570 1575 1580 

Tyr Val Arg Cys Asp Ala Gly Pro He Leu Met Val Gly Leu Ala He 
1585 1590 1595 1600 

Ala Gly Gly Met He Tyr Ala Ser Tyr Thr Gly Ser Leu Val Val Val 
1605 1610 1615 

Thr Asp Trp Asp Val Lys Gly Gly Gly Ser Pro Leu Tyr Arg His Gly 
1620 1625 1630 

Asp Gin Ala Thr Pro Gin Pro Val Val Gin Val Pro Pro Val Asp His 
1635 1640 1645 

Arg Pro Gly Gly Glu Ser Ala Pro Ser Asp Ala Asn Thr Val Thr Asp 
1650 1655 1660 

Ala Val Ala Ala He Gin Val Asp Cys Asp Trp Ser Val Met Thr Leu 
1665 1670 1675 1680 



Ser He Gly Glu Val Leu Ser Leu Ala Gin Ala Lys Thr Ala Glu Ala 
1685 1690 1695 



Tyr Ala Ala Thr Thr Lys Trp Leu Ala Gly Cys Tyr Thr Gly Thr Arg 
1700 1705 1710 



Ala Val Pro Thr Val Ser lie Val Asp Lys Leu Phe Ala Gly Gly Trp 
1715 1720 1725 

Ala Ala Val Val Gly His Cys His Ser Val He Ala Ala Ala Val Ala 
1730 1735 1740 

Ala Tyr Gly Ala Ser Arg Ser Pro Pro Leu Ala Ala Ala Ala Ser Tyr 
1745 1750 1755 1760 

Leu Met Gly Leu Gly Val Gly Gly Asn Ala Gin Thr Arg Leu Ala Ser 
1765 1770 1775 

Ala Leu Leu Leu Gly Ala Ala Gly Thr Ala Leu Gly Thr Pro Val Val 
1780 1785 1790 

Gly Leu Thr Met Ala Gly Ala Phe Met Gly Ser Ala Ser Val Ser Pro 
1795 1800 1805 

Ser Leu Val Thr He Leu Leu Gly Ala Val Gly Gly Trp Glu Gly Val 
1810 1815 1820 

Val Asn Ala Ala Ser Leu Val Phe Asp Phe Met Ala Gly Lys Leu Ser 
1825 1830 1835 1840 

Ser Glu Asp Leu Trp Tyr Ala He Pro Val Leu Thr Ser Pro Gly Ala 
1845 1850 1855 

Gly Leu Ala Gly He Ala Leu Gly Leu Val Leu Tyr Ser Ala Asn Asn 
1860 1865 1870 

Ser Gly Thr Thr Thr Trp Leu Asn Arg Leu Leu Thr Thr Leu Pro Arg 
1875 1880 1885 

Ser Ser Cys He Pro Asp Ser Tyr Phe Gin Gin Ala Asp Tyr Cys Asp 
1890 1895 1900 

Lys Val Ser Ala Val Leu Arg Arg Leu Ser Leu Thr Arg Thr Val Val 
1905 1910 1915 1920 

Ala Leu Val Asn Arg Glu Pro Lys Val Asp Glu Val Gin Val Gly Tyr 
1925 1930 1935 

Val Trp Asp Leu Trp Glu Trp He Met Arg Gin Val Arg Met Val Met 
1940 1945 1950 

Ala Arg Leu Arg Ala Leu Cys Pro Val Val Ser Leu Pro Leu Trp His 
1955 1960 1965 

Cys Gly Glu Gly Trp Ser Gly Glu Trp Leu Leu Asp Gly His Val Glu 
1970 1975 1980 

Ser Arg Cys Leu Cys Gly Cys Val He Thr Gly Asp Val Leu Asn Gly 
1985 1990 1995 2000 
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Gin Leu Lys Asp Pro Val Tyr Ser Thr Lys Leu Cys Arg His Tyr Trp 
2005 2010 2015 

Met Gly Thr Val Pro Val Asn Met Leu Gly Tyr Gly Glu Thr Ser Pro 
2020 2025 2030 

Leu Leu Ala Ser Asp Thr Pro Lys Val Val Pro Phe Gly Thr Ser Gly 
2035 2040 2045 

Trp Ala Glu Val Val Val Thr Pro Thr His Val Val He Arg Arg Thr 
2050 2055 2060 

Ser Ala Tyr Lys Leu Leu Arg Gin Gin He Leu Ser Ala Ala Val Ala 
2065 2070 2075 2080 

Glu Pro Tyr Tyr Val Asp Gly He Pro Val Ser Trp Asp Ala Asp Ala 
2085 2090 2095 

Arg Ala Pro Ala Met Val Tyr Gly Pro Gly Gin Ser Val Thr He Asp 
P 2100 2105 2110 

1 1 

^ Gly Glu Arg Tyr Thr Leu Pro His Gin Leu Arg Leu Arg Asn Val Ala 
2115 2120 2125 

Ti : 
E ~? 

SO Pro Ser Glu Val Ser Ser Glu Val Ser He Asp He Gly Thr Glu Thr 
= = 2130 2135 2140 

□ 

Iq Glu Asp Ser Glu Leu Thr Glu Ala Asp Leu Pro Pro Ala Ala Ala Ala 
I 2145 2150 2155 2160 

C3 

== Leu Gin Ala He Glu Asn Ala Ala Arg He Leu Glu Pro His lie Asp 
lZ 2165 2170 2175 

u 

Val He Met Glu Asp Cys Ser Thr Pro Ser Leu Cys Gly Ser Ser Arg 
12 2180 2185 2190 

s . 

h r: 
i 

Glu Met Pro Val Trp Gly Glu Asp He Pro Arg Thr Pro Ser Pro Ala 
2195 2200 2205 

Leu He Ser Val Thr Glu Ser Ser Pro Asp Glu Lys Thr Pro Ser Val 
2210 2215 2220 

Ser Ser Ser Gin Glu Asp Thr Pro Ser Ser Asp Ser Phe Glu Val He 
2225 2230 2235 2240 

Gin Glu Ser Glu Thr Ala Glu Gly Glu Glu Ser Val Phe Asn Val Ala 
2245 2250 2255 

Leu Ser Val Leu Lys Ala Leu Phe Pro Gin Ser Asp Ala Thr Arg Lys 
2260 2265 2270 

Leu Thr Val Lys Met Ser Cys Cys Val Glu Lys Ser Val Thr Arg Phe 
2275 2280 2285 

Phe Ser Leu Gly Leu Thr Val Ala Asp Val Ala Ser Leu Cys Glu Met 
2290 2295 2300 
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Glu lie Gin Asn His Thr Ala Tyr Cys Asp Lys Val Arg Thr Pro Leu 
2305 2310 2315 2320 

Glu Leu Gin Val Gly Cys Leu Val Gly Asn Glu Leu Thr Phe Glu Cys 
2325 2330 2335 

Asp Lys Cys Glu Ala Arg Gin Glu Thr Leu Ala Ser Phe Ser Tyr lie 
2340 2345 2350 

Trp Ser Gly Val Pro Leu Thr Arg Ala Thr Pro Ala Lys Pro Pro Val 
2355 2360 2365 

Val Arg Pro Val Gly Ser Leu Leu Val Ala Asp Thr Thr Lys Val Tyr 
2370 2375 2380 

Val Thr Asn Pro Asp Asn Val Gly Arg Arg Val Asp Lys Val Thr Phe 
2385 2390 2395 2400 

Trp Arg Ala Pro Arg Val His Asp Lys Phe Leu Val Asp Ser lie Glu 
p 2405 2410 2415 

?. "1 

fS Arg Ala Lys Arg Ala Ala Gin Ala Cys Leu Ser Met Gly Tyr Thr Tyr 
?j1 2420 2425 2430 

' fc if; Glu Glu Ala lie Arg Thr Val Arg Pro His Ala Ala Met Gly Trp Gly 
" 2435 2440 2445 

w 

C9 Ser Lys Val Ser Val Lys Asp Leu Ala Thr Pro Ala Gly Lys Met Ala 
s 2450 2455 2460 

r "*| 

~f Val His Asp Arg Leu Gin Glu lie Leu Glu Gly Thr Pro Val Pro Phe 
?~ 2465 2470 2475 2480 

i n 

!-j Thr Leu Thr Val Lys Lys Glu Val Phe Phe Lys Asp Arg Lys Glu Glu 
2485 2490 2495 

Lys Ala Pro Arg Leu lie Val Phe Pro Pro Leu Asp Phe Arg lie Ala 
2500 2505 2510 

Glu Lys Leu lie Leu Gly Asp Pro Gly Arg Val Ala Lys Ala Val Leu 
2515 2520 2525 

Gly Gly Ala Tyr Ala Phe Gin Tyr Thr Pro Asn Gin Arg lie Arg Glu 
2530 2535 2540 

Met Leu Lys Leu Trp Glu Ser Lys Lys Thr Pro Cys Ala lie Cys Val 
2545 2550 2555 2560 

Asp Ala Thr Cys Phe Asp Ser Ser lie Thr Glu Glu Asp Val Ala Leu 
2565 2570 2575 

Glu Thr Glu Leu Tyr Ala Leu Ala Ser Asp His Pro Glu Trp Val Arg 
2580 2585 2590 

Ala Leu Gly Lys Tyr Tyr Ala Ser Gly Thr Met Val Thr Pro Glu Gly 
2595 2600 2605 
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Val Pro Val Gly Glu Arg Tyr Cys Arg Ser Ser Gly Val Leu Thr Thr 
2610 2615 2620 

Ser Ala Ser Asn Cys Leu Thr Cys Tyr lie Lys Val Lys Ala Ala Cys 
2625 2630 2635 2640 

Glu Arg Val Gly Leu Lys Asn Val Ser Leu Leu lie Ala Gly Asp Asp 
2645 2650 2655 

Cys Leu lie lie Cys Glu Arg Pro Val Cys Asp Pro Ser Asp Ala Leu 
2660 2665 2670 

Gly Arg Ala Leu Ala Ser Tyr Gly Tyr Ala Cys Glu Pro Ser Tyr His 
2675 2680 2685 

Ala Ser Leu Asp Thr Ala Pro Phe Cys Ser Thr Trp Leu Ala Glu Cys 
2690 2695 2700 

Asn Ala Asp Gly Lys Arg His Phe Phe Leu Thr Thr Asp Phe Arg Arg 
□ 2705 2710 2715 2720 

frj Pro Leu Ala Arg Met Ser Ser Glu Tyr Ser Asp Pro Met Ala Ser Ala 
fll 2725 2730 2735 

*J lie Gly Tyr lie Leu Leu Tyr Pro Trp His Pro lie Thr Arg Trp Val 
*2 2740 2745 2750 

I Ji 

CO lie He Pro His Val Leu Thr Cys Ala Phe Arg Gly Gly Gly Thr Pro 
e 2755 2760 2765 

H ; Ser Asp Pro Val Trp Cys Gin Val His Gly Asn Tyr Tyr Lys Phe Pro 
fi 2770 2775 2780 

in 

12 Leu Asp Lys Leu Pro Asn He He Val Ala Leu His Gly Pro Ala Ala 
H 2785 2790 2795 2800 

E X 
? 

Leu Arg Val Thr Ala Asp Thr Thr Lys Thr Lys Met Glu Ala Gly Lys 
2805 2810 2815 

Val Leu Ser Asp Leu Lys Leu Pro Gly Leu Ala Val His Arg Lys Lys 
2820 2825 2830 

Ala Gly Ala Leu Arg Thr Arg Met Leu Arg Ser Arg Gly Trp Ala Glu 
2835 2840 2845 

Leu Ala Arg Gly Leu Leu Trp Arg Pro Gly Leu Arg Leu Pro Pro Pro 
2850 2855 2860 

Glu He Ala Gly He Pro Gly Gly Phe Pro Leu Ser Pro Pro Tyr Met 
2865 2870 2875 2880 

Gly Val Val His Gin Leu Asp Phe Thr Ser Gin Arg Ser Arg Trp Arg 
2885 2890 2895 

Trp Leu Gly Phe Leu Ala Leu Leu He Val Ala Leu Phe Gly 
2900 2905 2910 
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